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WHAT IS CLAIMED: 

1 . A method for locating data in a data file, comprising: 
5 determining the data unit to be located in the data file; 

determining a type for the data unit, 
when the type of the data unit is not Text, 

10 

selecting a different data unit as the location reference for the data unit; 

generating the location description for the data unit, based on the type of the data unit and the 
position relationship between the data unit and the different data unit, the description including 
1 5 the type of the data unit, one or more location elements for locating the data unit; 

each of the location elements including a combination of attributes so as to determine the 
position of the location element, the attributes including: 

20 a. a base different data unit; 

b. a from position in the base used as the location referring position for the location 
element;; and 

25 c. a skip which represents the offset of the location element from the from position. 

2. The method of claim 1 wherein the type is chosen from the group consisting of Text, 
SingleLine, MultiLine, Block and Iterator 

30 3. The method of claim 1 wherein when generating the location description for the data unit, 
based on the type of the data unit and the position relationship between the data unit and the 
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different data unit, and the description includes the type of the data unit, and at least one 
location elements for locating the data unit and chosen from the group consisting of Top, 
Bottom, Left and Right. 

5 4. The method of claim 1 wherein each of the location elements includes a combination of 
attributes so as to determine the position of the location element, said attributes chosen from the 
group consisting of a base different data unit, a position in the base different data unit and used 
as the location referring position for the location element; and an offset of the location element 
from the base different data unit. 

10 

5. The method for locating data in a data file according to claim 1, characterized in that the 
base different data unit is the data unit having the type "Text" or any data unit the attributes of 
the location elements of which have been determined. 

15 6. The method for locating data in a data file according to claim 1 , characterized in that the 
attributes further include: 

a. an until representing location element stops at a markup; 

20 b. a before representing the offset of the location element stops before a markup; 

c. an after representing the offset of the location element stops after a markup. 

7. The method for locating data in a data file according to claim 1, characterized in that the 

2 5 from is the start position or the end position of the base. 

8. An apparatus for locating data in a data file, comprising: 

a data unit determination unit, a type determination unit, a location reference determination unit 

3 0 and a data unit location description generation unit, 
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said data unit determination unit determining the data unit to be located in the data file; 

said type determination unit determining a type for the data unit, the type including "Text", 
"SingleLine", "MultiLine", "Block" and "Iterator"; 

5 

when the type of the data unit is not "Text", 

a. said location reference determination unit selecting a different data unit as the 
location reference for the data unit; 

10 

b. said data unit location description generation unit generating the location 
description for the data unit, based on the type of the data unit and the position 
relationship between the data unit and the different data unit, the description including 
the type of the data unit, one or more location elements for locating the data unit and 

1 5 including "Top", "Bottom", "Left" and "Right", 

c. each of the location elements including a combination of attributes so as to 
determine the position of the location element, the attributes including: 

20 i. "Base", which is the different data unit; 

ii. "From", which is a position in the "Base" and used as the location referring 
position for the location element; 

25 iii. "Skip", which represents the offset of the location element from the 

"From". 

9. The apparatus for locating data in a data file according to claim 8, characterized in that the 
3 0 "Base" is the data unit having the type of "Text" or any data unit the attributes of the location 
elements of which have been determined. 
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10. The apparatus for locating data in a data file according to claim 8, characterized in that the 
attributes further include: 

5 a. "Until", which represents the location element stops at a markup; 

b. "Before", which represents the offset of the location element stops before a 
markup; and 

10 c. "After", which represents the offset of the location element stops after a markup. 

1 1 . The apparatus for locating data in a data file according to claim 8, characterized in that the 
"From" is the start position or the end position of the "Base". 

15 12. A data transformation method, for transforming data in a first data file having a first format 
into data in a second data file having a second format, the data transformation method 
comprising: 

a. searching the data in the first data file to determine the position of the data ; 

20 

b. extracting the data the position of which has been determined; 

c. transforming the extracted data into data in the second data file, 
2 5 characterized by: 

i. generating the location description for one or more data units in the first data 
file to be located, and building up the correspondence between the data units and the 
second format of the second data file, before the data searching step, 

30 

ii. for each of the data units, performing the steps: 
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iii. determining a type for the data unit, the type including "Text", 
"SingleLine", "MultiLine", "Block" and "Iterator"; 

5 when the type of the data unit is not "Text", 

a. selecting a different data unit as the location reference for the data unit; 

b. generating the location description for the data unit, based on the type of the data 
1 0 unit and the position relationship between the data unit and the different data unit, the 

description including the type of the data unit, one or more location elements for locating the 
data unit and including "Top", "Bottom", "Left" and "Right", 

c. each of the location elements including a combination of attributes so as to 
1 5 determine the position of the location element, the attributes including: 

i. "Base", which is the different data unit; 

ii. "From", which is a position in the "Base" and used as the location referring 
2 0 position for the location element; 

iii. "Skip", which represents the offset of the location element from the 
"From", 

25 d. said data searching step locating the data units based on the location descriptions 

on the one or more data units; 

e. said data extracting step extracting the one or more data units being located; 

30 f. said data transformation step transforming the data units in the first data file 

extracted in said data extracting step into the data having the second format, based on the 
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correspondence build between the one or more data units and the second format, so as to 
generate the data in one or more second data files. 

13. The data transformation method according to claim 12, characterized in that the "Base" is 

5 the data unit having the type of "Text" or any data unit the attributes of the location elements of 
which have been determined. 

14. The data transformation method according to claim 12, characterized in that the attributes 
further include: 

10 

a. "Until", which represents the location element stops at a markup; 

b. "Before", which represents the offset of the location element stops before a 
markup; and 

15 

c. "After", which represents the offset of the location element stops after a markup. 

15. The data transformation method according to claim 12, characterized in that the "From" is 
the start position or the end position of the "Base". 

20 

16. A data transformation apparatus, for transforming data in a first data file having a first 
format into data in a second data file having a second format, the data transformation apparatus 
comprising: 

25 

a. a data searching unit, for searching the data in the first data file to determine the position of 
the data ; 

b. a data extracting unit, for extracting the data the position of which has been determined; 

30 

c. a data transforming unit, for transforming the extracted data into data in the second data 
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characterized in that: said data transformation apparatus further comprises: 

5 a. a data unit determination unit, a type determination unit, a location reference determination 
unit, a data unit location description generation unit and a format mapping unit, 

b. said data unit determination unit determining the data unit to be located in the data file; 

10 c. for each of the data units, performing the steps: 

i.determining a type for the data unit, the type including "Text", "SingleLine", 
"MultiLine", "Block" and "Iterator"; 

1 5 ii. when the type of the data unit is not "Text", 

(a) . selecting a different data unit as the location reference for the data unit; 

(b) . generating the location description for the data unit, based on the type of 
2 0 the data unit and the position relationship between the data unit and the different data 

unit, 

(c) the description including the type of the data unit, one or more location 
elements for locating the data unit and including "Top", "Bottom", "Left" and "Right", 

25 

d. each of the location elements including a combination of attributes so as to determine the 
position of the location element, the attributes including: 

i. "Base", which is the different data unit; 

30 

ii. "From", which is a position in the "Base" and used as the location referring 
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position for the location element; 

iii. "Skip", which represents the offset of the location element from the "From", 

5 e. said format mapping unit building up the correspondence between the one or more data 
units and the second format of the second data file, 

f. said data searching unit searching the data units based on the location descriptions on the 
one or more data units and determining the positions thereof; 

10 

g. said data extracting unit extracting the one or more data units the positions of which have 
been determined; 

h. said data transformation unit transforming the data units in the first data file extracted in 
1 5 said data extracting unit into the data having the second format, based on the correspondence 

build between the one or more data units and the second format, so as to generate the data in 
one or more second data files. 

1 7. The data transformation apparatus according to claim 16, characterized in that the "Base" 
20 is the data unit having the type of "Text" or any data unit the attributes of the location elements 

of which have been determined. 

18. The data transformation apparatus according to claim 16, characterized in that the attributes 
further include: 

25 

a. "Until", which represents the location element stops at a markup; 

b. "Before", which represents the offset of the location element stops before a markup; 
30 c, "After", which represents the offset of the location element stops after a markup. 
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19. The data transformation apparatus according to claim 16, characterized in that the "From" 
is the start position or the end position of the "Base". 
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